Rank in Wordlist | Frequency | Word |
---|---|---|
3439 | 4469 | 1,5 |
6061 | 2294 | 1,2 |
6130 | 2257 | 2,5 |
7663 | 1680 | 1,3 |
7693 | 1671 | 3,5 |
8956 | 1373 | 1,7 |
9119 | 1340 | 1,4 |
9413 | 1284 | 1,6 |
9569 | 1255 | 1,8 |
9945 | 1187 | 1,1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
197773 | 8 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
1853 | 8679 | 100% |
2213 | 7238 | 50% |
2461 | 6437 | 10% |
2626 | 6064 | 30% |
2687 | 5908 | 20% |
3061 | 5086 | 40% |
3122 | 4980 | 5% |
3331 | 4641 | 70% |
3426 | 4493 | 80% |
3772 | 3988 | 90% |
Rank in Wordlist | Frequency | Word |
---|---|---|
12281 | 892 | S&P |
35302 | 177 | R&B |
35825 | 173 | S&P/ASX |
42166 | 131 | P&D |
42321 | 130 | C&A |
43433 | 125 | J&F |
50743 | 96 | AT&T |
53526 | 88 | M&A |
60934 | 70 | A&M |
71009 | 54 | P&G |
Rank in Wordlist | Frequency | Word |
---|---|---|
22772 | 358 | R$1 |
27023 | 275 | R$10 |
29984 | 232 | R$1,5 |
30655 | 224 | R$100 |
35072 | 179 | R$20 |
35412 | 176 | R$50 |
36985 | 164 | R$ 20,00 |
38775 | 151 | R$5 |
40036 | 143 | R$2 |
40338 | 141 | R$500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
513 | 28576 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
17903 | 513 | d'água |
26310 | 287 | McDonald's |
33575 | 192 | .' |
42508 | 129 | D'Ávila |
51387 | 94 | D'Avila |
52892 | 90 | d'Avila |
53904 | 87 | Sant'Anna |
54540 | 85 | D'Or |
54552 | 85 | Don't |
62126 | 68 | L'Equipe |
Rank in Wordlist | Frequency | Word |
---|---|---|
34946 | 180 | Apple TV+ |
61133 | 70 | The Voice + |
83195 | 41 | O+Positivo |
107139 | 26 | LGBTQIA+” |
112343 | 24 | Rio+Saneamento |
113793 | 23 | 90+3 |
137769 | 17 | so+ma |
143134 | 15 | 90+2 |
148496 | 14 | 90+1 |
151414 | 14 | Sumol+Compal |
Rank in Wordlist | Frequency | Word |
---|---|---|
76066 | 48 | Sagitário A* |
363698 | 3 | Sagittarius A* |
Rank in Wordlist | Frequency | Word |
---|---|---|
4473 | 3268 | e/ou |
6024 | 2314 | https://www |
7227 | 1826 | km/h |
8398 | 1492 | Manaus/AM |
14129 | 727 | 2022/23 |
14479 | 703 | 2021/22 |
14702 | 688 | https://t |
15113 | 659 | 2022/2023 |
16007 | 603 | 2021/2022 |
16278 | 589 | https://bit |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots